Integrated Correction of Ill-Formed Sentences
نویسندگان
چکیده
This paper describes a system that performs hierarchical error recovery, and detects and corrects a single error in a sentence at the lexical, syntactic, and/or semantic levels. If the system is unable to repair an erroneous sentence on the assumption that it has a single error, a multiple error recovery system is invoked. The system employs a chart parsing algorithm and uses an augmented context-free grammar, and has subsystems for lexical, syntactic, surface case, and semantic processing, which are controlled by an integrated-agenda system. In the frequent case that there is a choice of possible repairs, the possible repairs are ranked by penalty scores. The penalty scores are based on grammar-dependent and grammarindependent heuristics. The grammar-independent ones involve error types, and, at the lexical level, character distance; the grammar-dependent ones involve, at the syntactic level, the significance of the repaired constituent in a local tree, and, at the semantic level, the distance between the semantic form containing the error, and normal act templates. This paper focuses on single error recovery.
منابع مشابه
Syntactic Recovery and Spelling Correction of Ill-formed Sentences
This paper describes syntactic repair and spelling correction of ill-formed sentences within a context-free grammar using non-static filtering, of ill-formed sentences which violate subjectverb agreement or premodifier-noun agreement. The system described here provides recovery of local trees, reconstruction of the sentence, and spelling correction of detected typographical errors. It also prod...
متن کاملAutomatic grammar correction for second-language learners
A computer conversational system can potentially help a foreign-language student improve his/her fluency through practice dialogues. One of its potential roles could be to correct ungrammatical sentences. This paper describes our research on a sentence-level, generation-based approach to grammar correction: first, a word lattice of candidate corrections is generated from an ill-formed input. A ...
متن کاملJudging Grammaticality: Experiments in Sentence Classification
A classifier which is capable of distinguishing a syntactically well formed sentence from a syntactically ill formed one has the potential to be useful in an L2 language-learning context. In this article, we describe a classifier which classifies English sentences as either well formed or ill formed using information gleaned from three different natural language processing techniques. We descri...
متن کاملError recovery for robust language understanding in spoken dialogue systems
In this paper, we proposed an example-based approach aiming at recovering ill-formed inputs to improve robustness of spoken dialogue systems. In this approach, a treebank, which contains example sentences and their correct parse trees, is used to provide clues for fixing the errors of ill-formed inputs. Particularly, the proposed error recovery method is suitable for spoken dialogue application...
متن کاملYet Another Chart-Based Technique for Parsing Ill-Formed Input
A new chart-based technique for parsing ill-formed input is proposed. This can process sentences with unknown/misspelled words, omitted words or extraneous words. This generalized parsing strategy is, similar to Mellish's, based on an active chart parser, and shares the many advantages of Mellish's technique. It is based on pure syntactic knowledge, it is independent of all grammars, and it doe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997